Discrete Mathematics 110 (1992) 43-59 
North-Holland 



43 



Universal cycles for combinatorial 
structures 

Fan Chung 

Bell Communications Research, Morristown, NJ 07960, USA 

Persi Diaconis 

Harvard University, Cambridge, MA 02139, USA 

Ron Graham 

AT&T Bell Laboratories, Murray Hill, NJ 07974, USA 

Received 2 July 1990 
Revised 12 April 1991 

Abstract 

Chung, F., P. Diaconis and R. Graham, Universal cycles for combinatorial structures. Discrete 
Mathematics 110 (1992) 43-59 

In this paper, we explore generalizations of de Bruijn cycles for a variety of families of 
combinatorial structures, including permutations, partitions and subsets of a finite set. 

1. Introduction 

The cyclic sequence C of 16 O's and I's shown in Fig. 0 has the following 
unlikely property. If we list each of the 16 possible blocks of 4 consecutive 
symbols of C, it turns out that they are all different. As a consequence, it follows 
that every possible 0-1 sequence of length 4 occurs this way (uniquely). The cycle 
C is an example of what has come to be known as a de Bruijn cycle. More 
generally, a (binary) de Bruijn cycle C„ of order n is defined to be a cyclic 
sequence {xq, Xi, . . . , jcj^-i) where jc, = 0 or 1, and each possible binary sequence 
of length n occurs uniquely as (jc,+i, ...» JC,+„) for some i, where index addition is 
performed modulo 2". The study of such cycles has had a long and distinguished 
history, and has arisen in a variety of contexts, such as design of Sanskrit memory 
wheels, digital fault testing, pseudo-random number generation, modern public- 
key cryptographic schemes, and even for use by illusionists in various mind- 
reading effects, to mention a few. (For an overview of this history, and indeed, 
the whole topic of de Bruijn cycles, the reader can consult [1, 5, 21, 14]. 
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Fig. 0. A de Bniijn cycle of order 4. 



Among the fundamental questions one might ask concerning de Bruijn cycles 

are: 

(i) Do de Bruijn cycles always exist for each n? 

(ii) If so, how many are there? 

(iii) How does one construct them? 

(iv) In a given de Bruijn cycle C, is there an easy way of determining the tth 
block as a function of /? 

(v) How can one 'invert' this process in C. That is, for each given block, where 
is it in C? 

(vi) How can one 'cut down' a de Bruijn cycle C. That is, when is it possible 
to remove elements from C so that the resulting contracted cycle C still has 
distinct blocks of length n (although some now will be missing). In the same spirit, 
how can one 'build up' or 'combine' de Bruijn cycles? 

(vii) What are the analogues for larger alphabets {k symbols rather than 2), or 
more dimensions (e.g., a de Bruijn 'torus' rather than a cycle), etc. 

We will summarize some of the known answers to some of these questions in 
Section 3. 

The thrust of this paper will be to consider the analogous situation for a variety 
of other combinatorial structures, rather than binary M-tuples. In particular, we 
will outline what is known for permutations of an n-set (Section 4), partitions of 
an n-set (Section 5), and A:-sets of an n-set (Section 6). In Section 2, we formulate 
our problem in a general setting, and in Section 3, we interpret de Bruijn cycles 
in this formulation. Finally, in Section 7, we describe possible future directions. 

2. A general fonnulation 

We begin by being given some family ^„ of combinatorial objects of 'rankn'. 
We denote their number by m := \^„\. We assume that each F e S'„is 'generated' 
or specified by some sequence (xi,...,x„), where Jc.eA, for some fixed 
alphabet A. 
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We will say that U = (oo, aj, . . . , a^-i) is a universal cycle for ^ (or (7-cycle, 
for short) if (fl,+i, . . . , a,+«), 0=si <m, runs through each element of ^„ exactly 
once, where index addition is performed modulo n. 

Now we can ask the standard questions: do [/-cycles for ^„ exist, if so how 
many, how do you construct them, invert them, combine them, extend them, etc. 
Of course, it is clear that some [/-cycles might be better than others for some of 
these purposes. When this is so, how do we find 'good' ones. 

In addition to their inherent combinatorial interest, one might also ask how one 
might use these [/-cycles. 



3. de Bruijn cycles 

We next sketch the standard approach used for treating de Bruijn cycles. In 
this case, 

^„ = B„ = {0, 1}" = {(jci, . . . , x„) I X, e {0, 1}, l^i^n}, m = 2" 
and each binary n-tuple (xi, . . . , x„) is just represented by itself, i.e., 
{xu ■ ■ • ,X„)<^(Xu . . . ,x„). 

(This will not be the case in most of the later situations.) 

The first step in constructing potential [/-cycles for B„ is to construct the 
(directed) transition graph G„ for B„. The vertices of G„ are all the n-tuples 
{0,1}". There is a directed edge (=arc) from (xi,...,x„) to iyi,---,yn) 
provided X2 = y\, Xt,= y2, . . . , x„ = y„-i. Thus, arcs look like 
{(xi, . . . , x„), (^2, . . . ,x„, j:„+i)). What this indicates is that it is possible to go 
from {Xi, . . . , x„) to {x2, ■ ■ ■ , x„+i) in a potential [/-cycle, namely, when the 
block . . . Xi, X2, ■ ■ ■ , x„, x„+i . . . occurs. 

We illustrate the graphs G2 and G3 in Fig. 1. 

From this point of view, a [/-cycle for B„ corresponds exactly to a directed 

circuit in G„ going through each vertex exactly once, i.e., a Hamiltonian circuit 
for G„. This is both good news and bad news. The good news is that our problem 
has been reduced to finding a very familiar object in graph theory, namely, 
Hamiltonian circuits. The bad news is that these objects are well known to be 
difficult to find! In fact, it is an NP-complete problem to decide if a graph in 
general even has a Hamiltonian circuit. 

Fortunately, we have a way around this problem in this case. What we can do is 
to define another digraph G*, called the arc digraph of G„, as follows. The 
vertices of G* will just be the arcs of G„. In particular the arc 
{{xx, . . . , x„_i, jc„), {x2, . . . , x„, jc„+i)) will correspond to the vertex labelled 
with the (n - l)-tuple (x2, . . . ,x„) in G*. The arcs of G* will be all pairs of 
vertices ((^i, . . . , y„_i) (y2, .... y„)) in G*, i.e., so that the 'head' of the first 
vertex label is equal to the 'tail' of the second vertex label. In Fig. 2, we show G| 
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Fig. 1. The graphs G2 and G3. 

and G3. It is clear now that a Hamiltonian circuit in G„ corresponds exactly to an 
'Eulerian' circuit in G*, i.e., a (directed) circuit passing through each arc exactly 
once. The advantage of this transformation is that Eulerian circuits in digraphs 
are easy to detect. To state this precisely, let us call a digraph G balanced if for 
every vertex v of G, indegree(v) = outdegree(w). Also, call G strongly connected 
if for any vertices u and v of G, there is a directed path in G from u to v. 




Fig. 2. The arc digraphs GJ and G%. 
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Fact. G has an Eulerian circuit if and only if G is balanced and strongly 
connected. 

It is not difficult to see that G* is balanced and strongly connected, and so is 
Eulerian. This in turn shows that G„ is Hamiltonian, i.e., has a U -cycle. Notice 
that G* is isomorphic to G„_,. A more careful analysis shows that in fact G* has 
exactly 2^""" different Eulerian cycles. For a good discussion of this topic as well 
as various generalizations such as fc-symbol alphabets, the reader is referred to 
[14, 15, 18]. 

In the next three sections we will attempt to apply the same analysis (with 
decreasing success) to permutations, partitions and k-set of an n-set, respectively. 

4. Permutatioos 

Let us denote by 5„ the set of all n\ permutations (or arrangments) of 
{1, 2, ... , n}. If a = (a,, fl2. • • • . ^n) and b = {by, ■ ■ . , b„) each are ^-tuples 
of distinct integers we will say that d and b are order-isomorphic, written d~b, if 

ai<aj <^ bi<bj. 

A f/-cycle t/„ = (ao, «i, • • • . e {1, 2, . . . , N), for 5„ will be n!-tuple 

such that each ae5„ is order-isomorphic to exactly one block (fl,+i, .... a,+„), 
where, of course, index addition is performed modulo n\ It is clear why we must 
in general take N>n since blocks of length n must always consist of n distinct 
symbols. An example of f/-cycle for is 

1 4 5 2 4 3. 

To begin the process of constructing {/-cycles of S„ we imitate the analysis used 
for de Bruijn cycles and construct the transition graph G„ for S„. We illustrate this 
for iV = 3 in Fig. 3. 

The arcs of G„ are defined as follows. Suppose (for n = 3) we have the 

sequence ■ ■ ■ A52x ■ ■ ■ where we are suppressing commas. Now 452 — 231. The 
next 3-block 52jc could have three possibilities. If a: = 1 then 521 ~ 321 so that we 
get the arc 231^321. If j: = 3 then 523-312 and we have the arc 231 -» 312. 




Fig. 3. G3. 
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Fig. 4. G3. 



Finally, if jc = 6 then wc have 526 — 213 and 231^213. So, even after we find a 
Hamiltonian cycle in G„, we still have to assign values a, to realize (order- 
isomorphically) the appropriate elements of 5„. We will have more to say about 
this latter. The structure of G3 can be simplified if we regroup the vertices as in 
Fig. 4. 

We have grouped permutations according to the order type of the first two 
elements, which are '12' and '21'. An arc in G3 from 213, for example to the 
group '12' denotes that there are really three arcs, one from 213 to each of the 
elements 123, 132 and 231 in the group '12'. Since each permutation now has 
exactly one arc leaving it, it suffices to find an Eulerian circuit in G3 in order to 
produce a Hamiltonian circuit in G3. We show such an Eulerian circuit for G3 in 
Fig. 5. The corresponding Hamiltonian circuit in G3 is 

Cr^32-^ 312 — - 123 — ^ 231 — - 321 — ii^-"213— ^ 
The key question is now this. How does such a cycle correspond to a f/-cycle 



Suppose we assign (as of yet) undetermined values for the potential (/-cycle as 



for S3? 



follows: 



U: a h c d e f. 




Fig. 5. An Eulerian circuit in G3. 
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Fig. 6. P3. 

We want the first 3-block abc to be order-isomorphic to the first permutation 132 
in our Hamiltonian circuit, i.e., abc — 132 which just means a<c<b. Similarly, 
we want bed ~ 312 which implies c<d<b, cde ~ 123 which impUes c<d<e, 
etc. 

We can represent the implied inequalities among a, b, . . . , f hy means of a 
partial order (which itself is just an acyclic digraph), where will denote the 
requirement that i < j. We show this partial order P3 in Fig. 6. 

What we now require is a mapping of {a, b, . . . ,f} into {1, 2, ... , N} which 
preserves order, i.e., a linear extension A of P3 into {1, 2, ... , A^} for a suitable 
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Fig. 7. The clustered transition graph G4 for S^. 
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N. In particular, it is natural to make N as small as possible (so that in particular 
the mapping should be onto). In this case, we can choose N = 4 and take 
A(a) = 1, A(c) = A(/) = 2, = 3, = = 4, which results in the (/-cycle 
142342 for 53. 

In Fig. 7 we show the 'clustered' transition graph G4 for ^4. A particularly nice 
Eulerian circuit for G4 is given in Fig. 8. 

If we assume that U4 = abc - • ■ a; is a {/-cycle which realizes this ordering of ^4 
then we can construct as we did for ^3 the implied partial order P4 (shown in Fig. 
8). This we show in Fig. 9. 

The main point is that P4 has height (= length of longest chain) 5. Thus, we can 
define the Unear extension X:{a, . . . , x}^ {1, 2, 3, 4, 5} by A(z) := length of 
longest chain ending in z, to produce the f/-cycle 

12341253415321453241325 4. 

In general, we can cluster vertices of the transition graph G„ to form G„ (by 
grouping together those n permutations for which the initial (« - 1) blocks are 
order-isomorphic), which is easily checked to be balanced and strongly con- 
nected, and hence Eulerian. It is shown in Hurlbert [10] that by appropriately 

abed 




Fig. 8. An Eulerian circuit for G4. 
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^'1 I 1 1 1 

1 2 3 4 5 

Fig. 9. A linear extension. 

restricting C, the implied ordering on the values in the 'lifted' f/-cycle is in fact a 
partial order P„ = P„{C), i.e., has no cycles. (In fact, we believe this to be the 
case for any Eulerian circuit C) If h(P„) denotes the height of P„ then there is a 
linear extension of P„ into {1,2,..., h{P„)}, and consequently there is a t/ cycle 
for S„ from symbols in {1, 2, ... , h(P„)}. 

Suppose we define N(n) := mine h{P„(C)) where C ranges over all Eulerian 
circuits in G„. Then any t/-cycle for S„ must use at least N(«) different symbols. 
The best bounds we currently have for N(n) are 

N{2) = 2, yV(3) = 4, N(4) = 5 and n + l^N{n)^6n for n^5. 

However, we believe the following. 

Conjecture. N(ri) = n + 1, n > 3. 

We close this section with several questions. How many [/-cycles for 5„ are 
there with exactly N(n) different vertices? What about with at most N{n) + c 
entries for a fixed constant c? Exponentially many? Can we find t/-cycles which 
are easy to invert? Suppose we just want a specified subset Xc.Sn to be 
represented by f/„. For which X is this possible? 
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Fig. 10. Partitions of {1,2,3,4}, C/4:a6c6ccccddcdeec. 



5. Partitions 



The next class of objects we consider is the set of P„ of partitions of the 
n-element set {1,2, . . . The number of such partitions is just Et=i {2}> 
where {1} denotes the Stirhng number of the second kind, and satisfies the 
recurrence 

(e.g., see [8]). 

How will we represent partitions? We will do the following. We illustrate the 
idea for « = 8. A t/-cycle for P„ will be a sequence composed of symbols from the 
set A = {a, b, c, . . .]. A block, for example, abacbccd, will represent a 
partition, in this case 13 | 25 | 467 ] 8, by putting / and ; in the same group of the 
partition if and only if the ith and yth symbols of the block are the same. In Fig. 
10, we list the 15 partitions of {1, 2, 3, 4} and a (/-cycle for P4. 

We can proceed in the canonical way in searching for {/-cycles by first 
considering the corresponding transition graph G„. In Fig. 11(a) we show G3. In 
Fig. 11(b) we redraw G3 by clustering certain partitions together as shown, to 
form G3. 




Fig. 11. The graphs G3 and G3. 
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? £/4 : xi X2 X4 xs 



2 3 

1 2 I 3 

1 3 I 2 

ll 2I 3 
1 I 2 3 
Fig. 12. 



We use the same convention as in the preceding section, namely, an arc from 
a partition jt to a cluster means that arcs go from n to all partitions of the custer. 
This reduced graph G3 is Eulerian, with the only Eulerian circuit being 



The final step is to 'lift' this circuit to an actual f/-cycle by assigning appropriate 
symbols in order to realize the corresponding pcurtitions. We show the set-up in 



However, we now get a contradiction since we can deduce x^i^x^ =x^ = x^. 
Thus, we have an example of a Hamiltonian circuit in G„ which cannot be 'lifted' 
to a {/-cycle. In fact, there are no {/-cycles for P^. 

Undaunted, we move on to P^. In Fig. 13, we show G4. 

As before, if we imagine contracting clusters to points, this graph is Eulerian. 
The reader may wish to test his or her understanding up to this point by finding 
an Eulerian circuit in and extending it to a {/-cycle for P4 (there is more than 
one way to do this). 

For the general case of P„, this procedure works quite well. It is not difficult to 
see that the clustered graph G„ is always Eulerian (for n > 3). The only problem 
we have to worry about is that some Eulerian circuits might not be able to be 
converted to {/-cycles. This can only happen if the implied (in)equalities in the 
symbols of the {/-cycle end up with forcing jc ¥=jc for some symbol x (as happened 
for n = 3). To prevent this, it is enough to require that a specific sequence W of 
partitions occur in the Eulerian circuit C. The purpose of W is to prevent a 
sequence of equalities (or unequalities) from going across the corresponding 
portion of the {/-cycle. For example, take n = 4 and let W be 

1 I 23 I 4, 12 I 34, 1|234, 1234, 123 | 4. 

When this portion of C is 'lifted' we get the situation shown in Fig. 14. Thus, we 
must have 




12 I 3 — > 13 I 2 — > 1 I 2 I 3 — > 1 I 23 



Fig. 12. 



',+2 = «l+3 =^ Oi+S = di+b = fl/+7 a/+8- 



We can think of W as a 'breaker' since if r ss / and * 3= t -I- 8 then neither — a, 
nor Ur^Os can be forced. In particular, if C has a 'breaker' which does not 
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Fig. 13. The reduced graph G4. 



include 1 | 2 | 3 | 4 then C can always be lifted to a (/-cycle. It is not difficult to 
show that fern 5= 4 this can always be done. 

It is amusing to note that there are exactly 52 partitions of {1,2,3,4,5}. In 
fact, a f/-cycle for P5 can be constructed with the alphabet A = {D, C, H, S, J} so 
that the symbol J occurs just once, and each of the other symbols occur at most 
13 times. For example, one such cycle is 

DDDDDCHHHCCDDCCCHCHCSHHSDSSDSSHSDDCHSSCHSHDHSCHSJCDC. 

U4 : ... a. + i ai+2 ai+3 0,^.4 a.+s a,+« a,+7 a.+g -• 

1 I 2 3 I 4 

1 2 I 3 4 

1 I 2 3 4 

12 3 4 

I 2 3l 4 



Fig. 14. 
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In partitular, this cycle can be realized with an ordinary deck of playing cards 
with one spade (= S) replaced by a joker (=7). It is not hard to see that for P„, 
we must have an alphabet \A\^n. For N^n, how many [/-cycles for P„ are there 
with \A\ = N1 How do you invert any of these (/-cycles? 

6. it-Sets of an n-set 

The final class of objects we consider is the family ["] of all /:-element subsets 
(= A: -sets) of an n-element set {0, 1, . . . , n — 1). As an example of a t/-cycle for 
this situation, we have for n = 8, k = 3, the following cycle U: 

02456145712361246703671345034601250135672560234723570147. 

A distinguishing feature of this situation is that each 3-set might occur in any of 6 
possible orders in U, but it is only allowed to occur once. That is, since the first 
3-block 024 represents the 3-set {0, 2, 4} then none of the five other 3-blocks 042, 
204, 240, 402 and 420 can occur in U. One consequence of this fact is that we 
cannot even define a transition graph G for [?]! For if {1, 2, 3} is represented by 
the block 123, for example, then the arc {1, 2, 3}— » {2, 3, 4} is possible in G (by 
having the block continue 1234- • •). However, if {1,2,3} is represented by 213 
then {1, 2, 3}-^ {2, 3, 4} cannot be an arc in G. Since we do not know which way 
{1, 2, 3} will be represented then we cannot give a meaningful definition of G. 

There is a simple modular condition which is necessary for the existence of 
(/-cycles for [*]. 

Fact. // [l\ has a U-cycle then k divides (21 1). 

Proof. Consider a fixed symbol «, =x in a (/-cycle C. Since all symbols 
Ui+j, —k <j <k, must be distinct from x, then each copy of x occurs in exactly k 
A:-blocks of C. Since these /:-blocks represent A;-sets of {0, . . . , n - 1} which 
contain x, and there are exactly (* 1 1) if these, the conclusion follows. □ 

It is easy to see that (/-cycles exist for [2] whenever this necessary condition is 
satisfied, i.e., n is odd. 

It has been shown by Jackson [12] that this necessary condition is in fact 
sufficient for = 3 if n is large enough. 

Theorem [12], U-cydes exist for [3], n > 8, provided (" 2 ') = 0 (mod 3). 

Idea of proof. We illustrate the idea for n = 8. We first tabulate all possible 
different ways of selecting 3 elements from an 8-cycle where we identify two 

choices if they only differ by a rotation. We describe these by their sequences of 
differences between consecutive elements (modulo 8) (see Fig. 15). We next 
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PATTERN DIFFERENCES fmod 8) 

o - 
o - 

o - 
o - 

o 

o 

Fig. 15. Possible cyclic patterns for 3-sets of an 8-set. 

select for each (ordered) pattern two of the three differences (underlined in Fig. 

15). 

Now we construct a digraph G with vertices labeled by 1, 2 and 3, and arcs 
from /■ to if ij is an (ordered) pair of differences selected in the previous stage. 
We show G in Fig. 16. 

For the next step we look for an Eulerian circuit C in G. In this case we take 

U 2 2 1 1 3 3 1 

Finally we check that the sum S of the elements of C is relatively prime to « = 8. 
Since E = 5 in this case, then this stage passes. If we have managed to succeed up 
to this point then we can now construct our iZ-cycle U as follows. We take the 
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3 

Fig. 16. The graph for 3-sets of an 8-set. 

'template' of differences 22113 31 formed by C repeated 8 times, and construct 
the sequence of length 7 ■ 8 = 56 having these differences (mod 8) between 
consecutive elements. (It does not matter what the first element of U is). Thus, U 
(starting with 0) is 

A: 221133122113312 ••■ 
U: 0245614571236124 

What Jackson shows is that it is always possible to construct a t/-cycle for ["] 
this way, provided 3 | (" 2 '), i.e., n^O (mod 3), and « > 8. 
These techniques can be extended to show the following. 

Theorem. U-cycles exist for [l] provided (" j') = 0(mod4), (n, 4) = 1 and n is 

sufficiently large. 

It has very recently been shown by Hurlbert [10] that the necessary condition 
(" J ') = 0 (mod 6) is also sufficient for the existence of [/-cycles for [%\. However, 
for A: = 5 or /c ^ 7 we are still completely baffled. 

We are willing to make the following conjecture though. 

Conjecture ($100). [/-cycles exist for [%] always exist provided k divides (Itll) 
and n > no(^). 



7. Future directions 

There are of course many other combinatorial structures for which these and 
similar questions can be raised. Thus include, for example, permutations with 
ties, ordered A:-sets of an n-set, A: -sets of an ^-element multi-set, /r-dimensional 
subspaces of an n-dimensional vector space over G¥{q), combinatorial A:-spaces 



58 



F. Chung et al. 



0 0 10 
.00014 
'0111 

10 11 

Fig. 17. A de Bruijn torus for 2 x 2 arrays. 

of an «-space (a la Hales- Jewett; see [9]), etc. One could also ask for 
higher-dimensional analogues of these questions. For example, is it always 
possible to construct a universal torus T for every 2A:-by-2A: binary array? In other 
words, we are asking for a (square) 2^^-by-2^' binary array T, with horizonal and 
vertical sides, respectively, identified, so that ail 2k-by-2k binary arrays occur in T 
exactly once. The simplest example of such a T is shown in Fig. 17. In fact, such 
T always exist (see [6]) although their number for each size is not known. 

Non-square toruses have been investigated in [2-4,7,11,16,18-19], and in 
particular in [20], where they arise in connection with robot self-location 
problems. 

Clearly we have barely scratched the surface of this subject, with the vast bulk 
of the interesting results remaining yet to be discovered. An excellent start in 
some of these directions can be found in [10]. 
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